adpat move_intermediate_cache for sglang prefix + mtp by silencejade · Pull Request #422 · sgl-project/sgl-kernel-npu

silencejade · 2026-04-02T08:12:50Z

Dependence

gemini-code-assist

Code Review

This pull request decouples source and destination indices in the Mamba state update Triton kernel and its wrapper function, allowing for more flexible cache movement. Previously, a single index tensor was used for both source and destination. Feedback suggests strengthening input validation by asserting that all index tensors (dst_indices_tensor, src_indices_tensor, and last_steps_tensor) have matching lengths and ensuring they are contiguous int32 tensors to prevent potential out-of-bounds access or type mismatches in the Triton kernel.

python/sgl_kernel_npu/sgl_kernel_npu/mamba/mamba_state_update_triton.py

adpat move_intermediate_cache for sglang prefix + mtp

96d7bdf

gemini-code-assist bot reviewed Apr 2, 2026

View reviewed changes

python/sgl_kernel_npu/sgl_kernel_npu/mamba/mamba_state_update_triton.py Outdated Show resolved Hide resolved

fix test && add input check

2d2fb6c

silencejade mentioned this pull request Apr 3, 2026

[NPU] Adapt mtp + prefix for ascend gdn backend Ascend/sglang#202

Merged

5 tasks

RuixuanZhang06 approved these changes Apr 3, 2026

View reviewed changes

RuixuanZhang06 merged commit d16fb13 into sgl-project:main Apr 3, 2026
3 of 6 checks passed

silencejade deleted the br_fix_stateupdate branch April 7, 2026 11:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adpat move_intermediate_cache for sglang prefix + mtp#422

adpat move_intermediate_cache for sglang prefix + mtp#422
RuixuanZhang06 merged 2 commits intosgl-project:mainfrom
silencejade:br_fix_stateupdate

silencejade commented Apr 2, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

silencejade commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependence

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

silencejade commented Apr 2, 2026 •

edited

Loading